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Abstract 

This paper considers the average consensus problem on a network of digital links, and proposes a set of algorithms based 
on pairwise "gossip" communications and updates. We study the convergence properties of such algorithms with the goal 
of answering two design questions, arising from the literature: whether the agents should encode their communication by a 
• deterministic or a randomized quantizer, and whether they should use, and how, exact information regarding their own states 
' in the update. 
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^■i. . 1 Introduction 

p ; 

■ In the latest years, algorithms to solve consensus problems have attracted a lot of interest. In a consensus problem 
' a group of agents has to agree about a certain quantity, starting from different initial estimates. A special interest 

ON . is devoted to average consensus, where the agents are requested to agree on the average of their initial estimates. 

■ Among the vast literature, we refer the reader to [14] and references therein. The difficulty of the problem resides 
in the communication constraints which are given to the agents. Such communication constraints are usually rep- 
resented by a graph: nodes are agents and edges are available communication links. Moreover, the communication 
across the links can be assumed to be perfect, or rather be digital and possibly subject to bandwidth constraints, 

J-j | interferences, erasures, packet losses, noise, delays. Among the many algorithms for consensus proposed in the litera- 
ture, particularly interesting is the so called gossip algorithm: at every time instant a randomly chosen pair of agents 
communicates and they average their states. Such algorithm, studied in detail in [2], has many appealing features: 
it reduces the number of communications with respect to deterministic algorithms and avoids data collision. The 
present paper is devoted to the adaptation of the gossip algorithm to a network of digital lossless channels, that is, 
subject to quantized communication. 



Related works 



The constraint of quantization, due to the use of digital channels or to computing and memory constraints, has 
been considered in consensus problems in several recent papers [17,4,8,9,3,1,7,13,18]. The present work differs from 
previous ones on many respects. With the notable exception of [9], and of the recent conference [18], quantization 
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has not been investigated in the context of gossip algorithm. On the other hand, the work [9], which has been an 
important reference to us, deals with a strictly related but different problem, that is, consensus of agents having 
quantized states, while our interest is in quantized communication among real-valued agents. The case of gossip 
algorithm is worth of consideration in the context of quantized communication, since it can not be reduced to the 
case of time-invariant communication. For instance, gossiping introduces randomness in the algorithm. This allows 
using probabilistic tools in the analysis, and can significantly improve the convergence properties (in a probabilistic 
sense). Indeed, it prevents the onset of periodic dynamics, as those noticed in the fixed-topology communication 
since [4]. 

One could believe that the finite-bandwidth communication constraint can be easily dealt with using incremental and 
adaptive quantizers, in which the finite-length messages encode the last update rather than the agents state. Actually, 
such approach to consensus algorithm with quantized communication has been undertaken in [3] , assuming a static 
communication network and logarithmic or zooming- in/zooming-out quantizers. However, in a gossip scenario the 
random time-dependence of the active communication links makes difficult to design a similar algorithm: hence the 
analysis of static quantizers is valuable. For these reasons, in this paper we shall consider static uniform quantizers. 



Statement of contributions 

The goal of this paper is to analyze the effects of quantized communication on the gossip algorithm [2], or, equivalently, 
the opportunity of using a gossip communication when quantization of the messages is imposed. The agents' states 
are assumed to be real numbers, while the transmitted messages belong to a finite set. Through both analytical 
results and simulations, we investigate two design questions: whether the agents should use the deterministic or 
the probabilistic quantizer, and whether they should use, and how, exact information regarding their own states 
in the update. This is done in the following way: on one hand, we compare a deterministic uniform quantizer and 
a probabilistic uniform quantizer; on the other hand, we consider three different update rules (partially quantized, 
totally quantized, compensating), which differ in how the agents use the information about their own state. Both the 
quantizers and the update rules are introduced in Section 2. 

Our results, which describe the limit behavior of the algorithms, are obtained by two different techniques. For the 
compensating rule with probabilistic quantization, we give a mean squared error analysis and convergence is proved 
as time goes to infinity. In all the other cases, we study a suitable Markov chain symbolic dynamics, obtaining results 
of convergence in finite time. Such a fact is remarkable, since it underlines the discrete nature of the problem, in 
spite of the state space being continuous. In more detail, we can summarize our results as follows. In Section 4, we 
show that the totally quantized rule ensures that the consensus is reached almost surely in a finite time, both using 
the deterministic quantizer and the probabilistic quantizer. The drawback of this update rule is that it does not 
preserve the average of the initial conditions, and the deviation from the initial average happens to be possibly large 
if the deterministic quantizer is used. On the other hand, the compensating rule preserves the initial average at each 
iteration of the algorithm, but does not guarantee that consensus is reached. However, we prove in Section 3 that 
the states get as close to average consensus as the size of quantization step, in a finite time. Finally, in Section 5 
we consider the partially quantized rule. While it does not preserve the initial average, it has good convergence 
properties. Indeed, if the deterministic quantizer is used, the states get in a finite time as close to consensus as the 
size of the quantization step. If the probabilistic quantizer is used, we can argue a stronger result of asymptotical 
convergence to consensus, and the expectation of the consensus value is the average of the initial states. 



2 Problem statement 

We start recalling the gossip average consensus algorithm: at every time step, a randomly chosen pair of agents 
communicates, and they average their states. This algorithm, brought to wide audience by [2], has many appealing 
features: it reduces the number of communications with respect to deterministic algorithms and avoids data collisions. 
Let us describe such algorithm in more detail. Assume we are given an undirected connected graph Q = (V, £), with 
V = {1, . . . , N} and £ C {(i, j) : i,j G V}. Each of the nodes of the graph is referred to as an agent, and endowed 
with a state, which is scalar function of time, Xi(t), for i£V. The values Xi(0) are given. At each time step t £ Z>o, 
one edge (i,j) is randomly selected in £ with positive probability W^' such that J2u j) g £ W^'^ = 1. Let W be 
the matrix with entries Wij = W^'^. The two agents connected by the selected edge average their states according 
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to 

Xi(t+1) = ^Xi(t) + -Xj(t) 

*j(*+l) = + (1) 

while 

x h (t+l) = x h (t) i£h^i,j. (2) 



Let Eij = (e-i — e 3 )(e, — ancTH 



1 



P(t) = I - -E^ (3) 



where ej = [0, . . . , 0, 1, 0, . . . , 0]* is a N X 1 unit vector with the i-th component equal to 1. Then (1) and (2) can be 
written in a vector form as 

x(t+l) = P(t)x(t), (4) 

where x(t) = [xi(t), . . . , X]y(t)]*. Note that P(t) is a symmetric doubly stochastic matrix, and then (4) preserves 
the average of states. It is known [2,15] that, if the graph Q is connected and each edge S £ is selected with a 

strictly positive probability, then, for every initial condition x(0), the algorithm (4) almost surely reaches the average 
consensus. That is, almost surely 

lim x(t) = X aV o(0)l, 
t — >oo 

where 1 is the column vector whose entries are 1, and for t E Z>o, we define x avc (t) = N" 1 ^2^ =0 Xi(t). If instead 
the weaker condition holds that limt_, 00 x(t) = £1, for some ^ £ 1, we say that the algorithm reaches consensus. 

Note that the gossip algorithm (4) relics upon a crucial assumption: each agent transmits to its neighboring agents 
the precise value of its state. Instead, in this paper we assume that the communication network is constituted 
of digital links. This prevents the agents from having a precise knowledge about the state of the other agents. 
Indeed, through a digital channel, the i-th agent can only send to its neighbors symbolic data: using this data, the 
neighbors of the i-th agent can build an estimate of the i-th agent's state. We denote this estimate by Xi(t), and 
let x(t) = . . . ,ijv(i)]* ■ In this paper, the estimate is simply the received symbol, computed via a suitable 

quantizer, that is an application mapping real numbers into a discrete set. 

2.1 Quantizers 

The first design issue is how to quantize the states, that is how to map the continuous space of states into a discrete 
alphabet of messages. Given X C R", we call quantizer a map q : X — > S, where S is a finite or countable set, 
endowed with the discrete topology. If we have a vector x € X N , with a slight abuse of notation, we will use the 
notation q{x) € S N to denote the vector such that q(x)i = q(xi). Many quantizers have been proposed in the vast 
literature on the subject [12]: here we concentrate on uniform quantizers with a countable alphabet, which can be 
thought as maps q : M — ► Z, up to a suitable rescaling. Two of such quantizers are of special interest to us, which 
we define below, and we call the deterministic and the probabilistic quantizer. A broader discussion is given in the 
final section. The deterministic quantizer is defined as follows. Let q^ : R — > Z be the map which sends z £ IR into 
its nearest integer, namely, 

z e [n-l/2,n+l/2[, if z > 

q d {z) =neZ <=^ 

z e ]n- l/2,n+ 1/2], if z < 0. 
This map enjoys the property that, for all z £ IR, it holds \z — q<i(z)\ < ^> 



The symbol M* is used to denote the conjugate transpose of the matrix M. 
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The probabilistic quantizer q p : R — > Z is defined as follows^] For any x S R, the image <7 p (x) is a random variable 
on Z defined by 

[xj w hh probability |~x] — x 
q p [x) = < (5) 
[ |x| with probability x — [x\ . 

In this case, for all z G R, it holds that \z — q P {z)\ < 1. The following lemma states two important properties of the 
probabilistic quantizer. 

Lemma 2.1 For every x £ M, it holds that 

E[q p (x)} = x, 
E (x - q p {x)) 2 



1 

< -. 

- 4 



Proof: The first equation is immediate, and the second one follows from computing 

E (x — q p (x)) 2 = x[x\ + x\x] — [x\ \x] — x 2 < —. 



□ 



The quantizers defined above map R into Z, and have quantization bins of length 1. More general uniform quantizers, 
having as quantization step a positive real number e, can be obtained from q : R — > Z by defining q^^x) = eq(x/e). 
Hence, the general case can be simply recovered by a suitable scaling, and our choice is not restrictive. 

2.2 Update rules 

We introduce three updating rules for the states, which require quantized communication. To describe them, let us 
assume that be the edge selected at the t-th iteration. 

a) (Totally quantized) In the first strategy, i and j, in order to update their states, use only their estimates, as follows, 

Xi(t + 1) = ^Xi(t) + ljXj(t) 

x j (t+l) = ±x j (t) + ±x i {t), (6) 

or, equivalently in vector form, by recalling the definition of P(t) given in (3), 

x(t + 1) = P(t)x(t). (7) 

b) (Partially quantized) If instead the agents have access to their real- valued state, a natural update rule is 

Xi(t+1) = \xi{t) + ^ Xj {t) 

Xj (t+l) = ^x j (t) + ±x i (t), (8) 

or, equivalently in vector form, 

x(t + 1) = i.T(t) + (P(t) - diagP(i))£ (t). (9) 



2 Elsewhere in the literature [1,8,16] this quantizer has been introduced as a result of dithering, that is the addition of a small 
random noise before (deterministic) quantization. 
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c) (Compensating) Both the above update rules do not preserve the average of states, which can be a significant 
drawback in some applications. To cope with this problem, we propose a third update rule, in which agents use 
both their real-valued states and their quantized values. To understand the idea behind this, consider the standard 
gossip update (1), which can be rewritten as 

Xi(t+1) 

xj(t + i) 

We then propose the following updating rule, 

Xi(t + 1) 

Xj (t + 1) 

or, equivalently in vector form, 

x(t + l) = x(t) + (P(t)-I)x(t). (11) 

Note that in facts law (11) preserves the initial state average, as the law (4) does. Formally, defining x ave (t) = 
±l*x(t), we have that x avc (t) = a; avc (0), for all t > 0. Indeed, l*x(t + 1) = l*x(t) + l*(P(t) - I)x(t) = l*x(t), 
where the last equality follows from the fact that, since P(t) is doubly stochastic for all t > 0, then l*(P(t) — I) = 
for all t > 0. The idea of the agents updating their own state using both exact and quantized information about 
their own state at the previous time step, has already been shown useful in the case of quantized consensus on a 
time- independent network [7] . From the point of view of communication theory, this strategy is meant to fully exploit 
the implicit channel feedback which comes from quantization: since communication is quantized, but the channels 
are reliable, each agent knows that its neighbors are going to receive the message it has transmitted. 

In the sequel, we refer to (7) as the totally quantized rule, to (9) as the partially quantized rule, and to (11) as the 
compensating rule. In the following sections, we proceed with a detailed analysis of the dynamical systems induced 
by the above rules. 



3 Compensating update 

We start our analysis from the update rule (11), considering first the case of deterministic, and then of probabilistic 
quantizers. 

3.1 Deterministic quantizer 
Consider the compensating strategy 

x t (t + 1) = x t (t) - -qd(xi{t)) + ^qd(xj{tj) 

Xj(t + 1) = xj(t) - -qaixjit)) + -q d (xi(t)). (12) 

The limit behavior of the above algorithm can be studied exploiting a natural symbolic dynamics interpretation of 
the states dynamics, obtaining results which are reminiscent of those in [9], We define rii(t) = [2xi(i] J for all i 6 V, 
and let n{t) = [rii(i), . . . , njv (£)]*• 

To start, we need the following technical lemma, whose proof can be found in [7]. 



x i{t) ~ \ x i{t) + \ x j{t) 



2 ' w 2 
x A t ) - + 2 



(10) 
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Lemma 3.1 Given a,/3 e N and iff, it holds 

\x\- laX 



q d (x) = [x+ 1/2J = 



"1 


[2(3x\ 
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Lemma 3.1 implies that qd{%i{t)) 



i(t) 



and then we can manipulate the dynamics as follows: 



From this we obtain that 



Xi(t + 1) = Xi(t) - ^q d (xi(t)) + ^qd(xj(t)) 
[2x,(t + 1)J = L2xi(t)J - g d (ii(t)) + <? d (zj(*)), 



n,(t) 



n,(i + 1) = rn(t) - 


~m(ty 

2 


+ 




m (t) 






(t)] 




2 


+ 


2 



We have thus found an iterative system involving only the symbolic signals ni{t). When the edge (i,j) is selected, i 
and j adjourn their states following the pair dynamics 



where </i:ZxZ— > Z X Z is 



(m(t + l),nj(t + 1)) = ,giK(t),n 3 (i)) 
gi(h, k) = 



(13) 



Notice that g\ is symmetric in its arguments, in the sense that if gi(h,k) = (?7,x), then g\{k,h) = (XiV)- Since 
rii(t) = [2xi(t)\, the analysis of the evolution of (13) allows us to obtain information about the asymptotics of Xi(t). 
We have the following result. 

Theorem 3.2 Let n(t) evolve according to (13), and let 

K = jr e Z N :3a e Z s. t. r - al E {0, 1} N \ . (14) 

Then, for every initial condition n(0) € Z, almost surely there exists T con £ Z>o such that n(t) G 1Z for all t > T con . 
Proof: The proof is based on verifying the following three facts: 

(i) the set 1Z, defined in (14), is an invariant subset for the evolution described by (13); 

(ii) n(t) is a Markov process on a finite number of states; 

(iii) starting from any state in Z , there is a positive probability for n(t) to reach a state in 1Z in a finite number of 
steps. 

Let us now check them in order, 
(i) Let h S Z. Observe that 



.91 (h, h + 1) 



(h + 1, h) if h is even 



(h, h + 1) if h is odd. 
This implies that 1Z is an invariant subset for the dynamics described by (13). 



(» 



(ii) Markovianity immediately follows from the fact that subsequent random choices of the edges are independent. We 
prove now that the states are finite. To this aim let (h' , k') = g\(h, k). The form of g\ implies that 



max {h , k'} < max {h, k} 



min {h, k} < min {h! ', k } . 



Define 



m(t) 



min rii 

Ki<N 



and 




(15) 



Then, the above inequalities imply that m(t) > m(0) and M(t) < M(0). This implies that the cardinality of the 
set of the states is upper bounded by (M(0) — m(0) + l) N . 
(iii) Let D(t) = M(t) — m(t). The proof of (iii) is based on the following strong result about the monotonicity of D(t): 
if D(t) > 2, then there exists r G N such that 



Now we prove (16). We define X a (t) = {i 6 V : Tii(t) = a}, and fix a time t G N. We shall prove that the 
cardinality of I m u Q ){t), denoted by \l m ^(t)\, does not increase as a function of time, and that, if D(t) > 2, then 
there is a positive probability that it decreases within a finite number of time steps. Notice first that, for h, k G Z, 
gi (h + 2, fc + 2) = <7i(ft, A;) + 2. Hence, by an appropriate translation of the initial condition, we can always restrict 
ourselves to the case m(t ) G {0, 1} , which is easier to handle. 

Case m(io) = 0. In this case it is possible for a nonzero state to decrease to 0, but only in the case of a swap 
between and 1. This assures that |X m ( to )(t)| is nonincreasing. Let S(t) denote the set of nodes which have value 
m(to) +2 or larger. Since -D(io) > 2 then S (to) is non empty. Now let («i,i>2, . . . ,v p -\,v p ) be a shortest path 
between X m (t ) (t) an d <S(to). Such a path exists since Q is connected. Note that v\ G X m n Q ^(i) and v p G <S(to) and 
that {t>2, ■ ■ • ,v P —x} could be an empty set; in this case a shortest path between X m ( to )(£) and S(to) has length 1. 
Note also that all the nodes in the path, except v\ and v p , have value 1 at time to, otherwise (v\, V2, ■ ■ ■ , v P —x, v p ) 
would not be a shortest path. Since each edge of the communication graph has a positive probability of being 
selected at any time, there is also a positive probability that in the p — 1 time units following to the edges of 
this path are selected sequentially, starting with the edge (vi,v 2 )- At the last step of this sequence we have that 
the values of v p -\ and v p are updated. By observing again, that the pair of values (0, 1) is transformed by (13) 
into the pair (1,0), we have that the value of v p -x, when the edge (v p -\,v p ) is selected, is equal to 0. This 
update, for the form of (13), makes the value of both nodes be strictly greater than 0. Therefore, this proves 
that |Z m ( to )(io + p — 1)| < |T m ( to )(io)| with positive probability. Clearly, if \X m ^ to ^(to)\ = 1 then we have also that 
D(to + p — 1) < D(to) with positive probability. 

Case m(to) = 1. In this case no state can decrease to 1, and thus |T m ( to )(f)| is again nonincreasing. Let Z m ( to )(t), 
S(t) and (vi, v%, . . . , v p -\, v p ) be defined as in the previous case. In this case all the nodes v%, . . . , v p -\ in the path 
have value equal to 2. Moreover observe that also the sequence of edges (v p -i,v p ), (v p -2, fp-i), • • • ,(v2, V3), (v\, V2) 
has positive probability of being selected in the p — 1 time units following to. At the last step of this sequence 
of edges, the values of v\ and v 2 are updated. The value of v\ is equal to 1. Since the value of v p at time to is 
greater or equal to 3, and since the pair (2, 3) is transformed by (13) into (3, 2), we have that the value of i>2 when 
the edge (vx, ^2) is selected, is greater or equal to 3. This update, for (13), causes the value of both nodes to be 
strictly greater than 1. Hence \T m (t )(to +p — 1) < |2 m ( to )(i) with positive probability. Again, if \I m ^ to ^(t)\ = 1 
then we have also that D(to +p — 1) < D(t) with positive probability. 

Consider now the following sequence of times t = to, tx, £2, For each i > 0, let ti+\ be the first time for which 

there is a positive probability that \T m ( t .) (ii+i) < \T m ( t .) (U)\. Let now k G 1>>o be such that |X m ( tfc )(i/ c ) = 1. 
Then we have that D(tk+\) < D(tk) with positive probability. This ensures the validity of (16). 

The proof of the fact (iii) follows directly from (16). Indeed, let n £ 1Z, then, from a repeated application 
of (16) it follows that, there exists a path of the Markov chain connecting n to a state n 1 = [n[, . . . , n' N ], such that 
max {n'j, . . . , n' N } — min {n^, . . . , n' N } < 2, that is, n 1 G 1Z. 



We can go back to the original system, and prove the following result. 

Corollary 3.3 Consider the algorithm (12). Then, almost surely, there exists T con G Z> such that for all t >T C 



F[D(t + r) < D(t)} > 0. 



(16) 



□ 



\xi(t) - Xj (t)\ <1 Vi,j. 



(17) 
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and hence, 

\\x(t) (0)l]|oo < 1. 

Proof: The proof is an immediate consequence of Theorem 3.2 and of the relation ni{t) = |_2xj(t)J , which assure 
that the states belong to two consecutive quantization bins. □ 

Our results about algorithms for quantized communication (here and in following sections) can naturally be related 
to the analysis of gossip consensus algorithms with quantized states. Namely, the important connections with the 
work in [9] are the object of the next two remarks. 

Remark 3.1 It is worth noting that Theorem 3.2 is a variation of Lemma 3 and Theorem 1 in [9], In [9] the authors 
introduced a class of quantized gossip algorithms, satisfying the following assumptions. Let (i,j) be the edge selected 
at time t and let rii(t) and nj(t) the values at time t of node i and of node j respectively. If ni(t) = Tij(t) then 
rii(t + 1) = ni(t) and rij(t + 1) = rij(t). Otherwise, defined = |rij(t) — rij(t)\, the method used to update the 
values has to satisfy the following three properties: 

(PI) m(t + 1) + nj(t + 1) = m{t) + nj(t), 

(P2) if D tj (t) > 1 then D l0 (t + 1) < D tJ (t), and 

(P3) if Dij(t) = 1 and (without loss of generality) m(t) < rijit), then m(t + 1) = rij{t) and rij(t + 1) = m{t). Such 
update is called swap. 

Now we substitute the property (P3) either with the property 

(P3') if D.ij(t) = 1 and (without loss of generality) < nj(t), then, if rij(£) is odd, then n,(t + 1) = rij(t) and 

n,j(t + 1) = rii{t), otherwise if ni(t) is even then m(t + 1) = rii{t) and rij{t + 1) = nj(i) 

or with the property 

(P3") if Dij(t) = 1 and (without loss of generality) rii(t) < rij(t), then, if m(t) is even then ni(t + 1) = %(£) and 
rij{t + 1) = nj(i), otherwise if rij(i) is odd then nj(t + 1) = and 7ij(i + 1) = rij(t). 

If we consider the class of algorithms satisfying (PI), (P2), (P3') or satisfying (PI), (P2), (P3"), it is possible to 
prove that Lemma 3 and Theorem 1 stated in [9] hold true also for this class. The proofs are analogous to that of 
Theorem 3.2 provided above. Moreover it is easy to see that the algorithm (13) satisfies the properties (PI), (P2), 
(P3'). This represents an alternative way to prove Theorem 3.2. 

Remark 3.2 We observe that a sensible algorithm for consensus with deterministically quantized communication 
comes applying the algorithm of Kashyap, Ba§ar and Srikant [9], which we refer to as the KBS algorithm. The 
adaptation we propose is as follows. The initial real values are first quantized, and then the KBS algorithm is 
applied: convergence to consensus is guaranteed up to an error of one, in finite time. Since the worst case error 
committed by quantizing the initial states is 1/2, we conclude that for t large enough \\x(t) — cc aV e(0)l||oo < §• Hence 
the convergence properties of this algorithm, which renounces making any use of the continuity of states, are worse, 
but comparable with those of the algorithm (12). 

3.2 Probabilistic quantizer 

In this section we assume that the information exchanged between the systems is quantized by means of the prob- 
abilistic quantizer q p described in (5), namely Xi(t) = q p (xi(t)). Dealing with updates based on the probabilistic 
quantizer, we make the following natural assumption of statistical independence. 

Assumption 1 For all t £ Z>o, given the values Xi(t) for all i 6 V, the random variables q p {xi{t)), as i varies, 
form an independent set. Moreover, for allt G Z>o, given the value Xi(t) for any i € V, the random variable q p (xi(t)) 
is independent from Xj(t), for every j ^ i. 
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The algorithm for the compensating strategy, when the edge is chosen, can be written as 

Xi(t + 1) = Xi(i) - -q p (xi(t)) + -q P (xj(t)) 

Xj(t + 1) = Xj (t) - \q P (x 3 {t)) + y p (xi(t)). (18) 

The analysis of (18) is more complicated than for the corresponding law (12). This is mainly due to the lack of 
convexity properties which were used in the analysis of (12). The following example shows this type of difficulty. 

Example 1 Consider (12) and assume that the edge (i, j) has been selected at time t. Without loss of generality 
assume that Xi(t) < Xj(t). Then, by convexity arguments, we have that [^(OJ < Xj(t+1), Xj(t+1) < \xj(t)~\. This is 
no longer true for (18). As a numerical example assume that Xi(t) — 3.4 and Xj(t) = 3.6. Then with probability 0.16 
we have that q p (xi(t)) = 4 and q p (xj(t)) = 3. In such a case, by (18), we have that Xi(t+ 1) = 2.9 and Xj(t+1) = 4.1. 
Hence, Xi(t + 1) and Xj(t + 1) do not belong to the interval [[a;j(i)J, [^-(i)]]. 

For this reason, we do not develop a symbolic analysis for this algorithm, and we do not prove convergence in finite 
time. Instead of a symbolic analysis, we provide a mean-square analysis, yielding interesting convergence results. We 
start by observing that (18) can be rewritten as 

x(t + 1) = P(t)x(t) + (P(t) - I) (q p (x(t)) - x(t)) (19) 

Let 

y(t) = ( 7 -^ ir ) *(*)> 

and remark that y(t) = x{t) — -^-ll*x(0). 
Now, from (19), we can write 

/ - 111*) x(t + 1) = (/ - 111*) P(t)x(t) + (l- 111*) (P(t) I) (q p (x(t)) - x(t)) . 

Define the quantization error as 

e(t) = q p (x(t))-x(t). 

Since (/ - ^11*) P(t) = P(t) (I - ^11*) and (/ - ^11*) (P(t) - 1) = P(t) - I, we obtain the following recursive 
relation in terms of the variables e(t) and y(t): 

y(t + 1) = P(t)y(t) + (P(t) - I)e(t). (20) 

In order to perform an asymptotic analysis of (20), it is convenient to introduce the following matrices. Let 

E w (t) - E [y(t)y*(t)} , S ffl (t) = E [e(t)e(t)*] , E„ e (t) = E [y(t)e(t)*] , (21) 

where the expectation is taken with respect to both the randomness due to gossip communication and to quantization. 
Equation (20) leads to the following recursive equation in terms of the above matrices 

Zyy(t + 1) = E lP(t)Z yy (t)P(t)} + E [Pm ye {t)(P{t) - I)] 

+ E [(P(t) - m; e P(t)] + (P(t) - I) £ ee (t) (P(t) - I) . (22) 
The following proposition states some correlation properties of the variables y and e. 

Proposition 3.4 Consider the vector random variables y(t) and e(t) defined above, and the matrices in (21). Then 

E[e(t)]=0 and E ee (i) = diag {af(t), . . . , a%(jt)} 
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where of (t) = E [ef(t)] is such that af(t) < 1/4, for all i E {1, . . . , N} and for allt>0. 
Moreover 

E ye (t) = 0, 

for all t > 0. 

Proof: Using conditional expectation properties, and Lemma 2.1, we have that 



(23) 



E[en{t)]=E[E[q p {xi{t))-Xi(t)\xi 

= E[E[q p (x i (t))\x i (t)]-x i (t)} 
= E[xi(t)-Xi{t)} 
= 0. 



(24) 



Moreover, for i j, using Assumption 1, 



E[e i {t)e j (t)]=E\E[e i {t)e j {t)\x i (t), Xj {t)}} 



E[E [e t (t)\xi(t), x j (t)}E\e 3 {t)\x l (t), x 3 (t)}} 

E[E [e t {t)\x t {t)}E[e 3 it)\x 3 it)]] 





If i = j, using again Lemma 2.1, we have that 



E[ef(i)] =E 



< E 
1 

~ 4 



(<7 P (^W) - Xi(t)Y 



(q p (xi(t)) - Xi(t)) \xi(t) 



An argument similar to the one used to prove that E \ei{t)ej{t)] = allows to prove that E \xi{t)ej{t)] = for any 
i ^ j. This easily yields (23). □ 



From the above properties we have that (22) can be rewritten as 

Z yy (t + 1) = E [P(t)E yy (t)P(t)] + E [(P(t) - I) S ee (t) (P(t) - I)} . 

To estimate the asymptotic distance from the initial average, we introduce the cost function 



J(W) =lun S up\l ±E[\\y(t)P]. 

t->oo V IV 

The cost depends on the selection probabilities W, and, thanks to the above definitions, can be computed as 



(25) 



(26) 



J(W)=limswp^-tx{-E yy (t)}. 



(27) 



We can rewrite the evolution law (25) as 

+ 1) = Af(Eyy(t)) + C(S ee (t)), 

where N and Q are linear operators from M. NxN to itself. Namely, given a matrix M, Af(M) 
Q(M) = E [(P(t) - I) M (P(t) - /)] . 



E [P(t)MP(t)] and 
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It is useful to remark that J\f is actually the evolution on T, yy for the gossip algorithm [2], in the absence of 
quantization error, while Q can be regarded as a disturbance due to the quantization error. From [5], we know 
that in the case of no quantization the system converges almost surely to consensus. This implies that TV is an 
asymptotically stable operator when restricted to the subspace S = {M e M. NxN : 1*M1 = 0}. Since 1*Q(M)1 = 
for any matrix M and E TO (0) £ S, we have that E TO (i) G S for all t > 0. 

Computing J(W) is a quite difficult problem. We then try to simplify the problem by introducing the following 
auxiliary system 

£(< + 1) = E [P(t)Z(t)P(t)] + ^E [(P(t) - I) 2 ] , (28) 
where E(0) = E yy (0), and the following cost function 



t— >oo 



J = limsupW4tr{S(i)}. 



N 



In principle, J should depend on W, too. However, we are going to prove that this is not the case. We have the 
following comparison result. 

Proposition 3.5 Consider the cost functions J(W) and J. We have that 

J(W) < J. 



Proof: To prove the statement we show, by induction on t, that £(t) > H yy (t) for all t > 0, where the inequality 
is meant in matricial sense, that is, — T, yy (t) is a semidefmite positive matrix. 

Since S(0) = 11^(0), the assertion is true for t = 0. Assume now that £(t) > H yy (t) is true for a generic t. Then, 



E(t+1)-E w (t+1) 



= E [P(t)E(t)P(t)] + -E [(P(t) - 7) 2 J - (E [P(<)E w (t)P(t)] + E [(P(t) - I) E ee (t) (P(t) - /)]) 
= E [P(t)(S(t) - S w (*))P(t)] + E [(P(t) - /) - E„(t)] (P(f) - /) 



Since by inductive hypothesis E(t) > E TO (t) and since by Proposition 3.4 we know that E ee (t) < jl for all t > 0, 
we have that E(i + 1) -E TO (i + l) > 0. □ 

Observe now that, since P(t) 2 = P(t), we obtain that E[(7 - P(t)) 2 ] = I - E[P(i)]. From this fact we obtain the 
following result. 

Proposition 3.6 Given the above definitions and (25), 



lim sm = -( I - —IV 



Proof: Define the matrix Q = E[(7 - P(t)) 2 ]. Since Eyy(0) € >S, and A/" is asymptotically stable if restricted to 
the subspace S, then 

+ oo 

lim t(t) = YjV^(Q). 

t— >oo * — ' 



t=0 
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This is the only fixed point of the iteration law (28). Thus we are left to prove that S* = | (I — -^11*) is a fixed 
point, that is E* = 7V(X*) + Q. This is true, because 



7V(£*) + Q = -E 



P(t)(/--ll*)P(i) 



■(/-E[P(t)]) 



j|E[p(i) 2 ] -Iir + j_E[p(t)]| 



4 

4 I JV 



Corollary 3.7 For every weight matrix W, it holds true that J(W) < \. 



Proof: From Proposition 3.6 we can argue that J = iw and since J(VF) < J, the claim follows 



□ 



□ 



From these theorems we draw a strong conclusion about the convergence of the algorithm. In spite of missing 
consensus in the strict sense, the asymptotical mean squared error of the algorithm is smaller than the size of the 
quantization bin, and has a bound which does not depend on the number of the agents, nor on the topology of the 
graph, nor on the probability of the edges selection. 



4 Totally quantized update 

This section is devoted to study the totally quantized update 



Xi (t + 1) = -Ut) + -Xj(t) 



1 



1 



introduced in (6), with both the deterministic and probabilistic quantizers. 
4-1 Deterministic quantizer 

In this subsection we consider the totally quantized strategy with deterministic quantizer 

Xi(t+1) = -qd{xi{t)) + -qd{xj{t)) 



1 



1 



(*+!)= o<Zd(zj(*)) + -q d (xi(t)) 



(29) 



We underline immediately that the update (29) only uses quantized information, and not exact information combined 
with quantized information as the update (12). This makes the analysis of (29) slightly easier than the analysis 
of (12). We show in this subsection that the law (29) drives, almost surely, the systems to consensus at an integer 
value. However, the initial average of states is not preserved in general. Again, the analysis of this algorithm can be 
performed efficiently by means of a symbolic dynamics. Let again ni(t) = \2xi{t)\ for all i € V. From (29) and the 



fact that qd{xi{t)) 



n,(t) 



we obtE 



(rii(t+ l),nj(t+ 1)) = (g 2 (n i (t) 1 n : j(t)) 1 g2(n l (t),n j (t))) 
where 52 : Z x Z — > Z is defined as 

g 2 (h,k) = 



(30) 



~h~ 




"fc" 




+ 




2 


2 
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Define 



A = {y eZ" | 3a G Z such that y = 2al}. 



(31) 



We have the following result. 

Theorem 4.1 Let n(t) evolve according to (30). For every fixed initial condition n(0), almost surely there exists 
Icon G Z>o smc/i that n(t) G .4 /or all t > T con . 

Proof: As for the proof of Theorem 3.2. it is sufficient to verify the following three facts: 

(i) each element in the set A is invariant for the evolution described by (30); 

(ii) n(t) is a Markov process on a finite number of states; 

(iii) starting from any state in Z N , there is a positive probability for n(t) to reach a state in A in a finite number 
of steps. 

Let us now check them in order. 

(i) is trivial. 

(ii) Markovianity immediately follows from the fact that subsequent random choices of the edges are independent. 
To prove that the states are finite, define m(t) and M(t) as in (15). Let p,q S Z with p < q. Then, the form of 
<72 implies that p < gi (p, q) < q + r q where r q denotes the remainder in the division of q by 2. It follows that 



This yields (ii). 

(iii) Let us fix t = to, and assume that n(to) ^ A. We prove that there exists r G N such that P [n(t.Q + r) G .4] > 0. 
We start by observing that, from the assumption of having a connected graph, there exists (h, k) £ £ such that 
nh(to) = m(to), rifc(io) = q and g2{m(to), q) > rn(to). Indeed, two cases are given when n(to) ^ A. 

• If m(io) < M{t n ), then it suffices to consider an edge (h, k) such that n^t®) = m(t ) and rifc(io) = Q > m(to), 
which gives g2{m(to),q) > m(to). Note that such an edge exists from the hypothesis of having a connected 
graph; 

• if m(to) = M(to), necessarily we have that m(to) and M(to) are odd; then <?2(m-(io), rn(to)) > m(to). 

We define now I a {t) = {i G V : n.i(t) = a}. The above discussion implies that \T m ( to j(to + 1)| < \X m t to ^(to)\ 
with the positive probability of choosing the edge (h, k) and hence that there is also a positive probability that 
at some finite time t' > to, \T m ^ to ^{t')\ ~ 0, that is m{t') > m(to). Iterating this argument and recalling that 
M{t) < M(t ) + r M (t ) f° r au * > *o, h follows that there exists r G N such that P [n(t + r) G A] > 0. 

This proves the thesis. □ 

We can now go back to the original system. The following corollary follows immediately from the definition of n(t). 

Corollary 4.2 Let x(t) evolve according to (29). Then almost surely there exists T con G Z>o and a G Z such that 
Xi (t) = a for all i G V and for all t > T con . 

We have already underlined the fact that this strategy does not preserve the initial average, in general. However, 
the convexity argument developed in the above proof, step (ii), implies a worst case bound on the error committed 
by the algorithm, as 



m(0) < Tu(i) < M(0) + r M(0) Vi G V \/t > . 



(32) 



\a - x, 



ave 



(0)1 <- 2a 

Wl - 2 N 



l*n(0) +i ^l*n(0) - 2x avc (0) 




< max Xi (0) — min a;, (0) + — . 



(33) 



The significance of this apparently conservative bound will be discussed in Remark 4.1. 
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4-2 Probabilistic quantizer 



The algorithm for the totally quantized strategy, when the edge is chosen, can be written as 

Xi(t + 1) = ^q P (xi(t)) + ^q P (xj(t)) 

Xj(t + 1) = -q P (xj(t)) + \q P {x % (t))- (34) 

A first remark is that ¥\x avc (t + 1) = %a,ve(t)], that is, the average is preserved in expectation. Below we prove that 
the law (34), as the law (29), drives almost surely the systems to consensus at an integer value. Using a probabilistic 
quantizer in a gossip algorithm, we have to deal with two sorts of randomness, since the interacting pair is randomly 
selected, and the quantization map is itself random. This makes the analysis of (34) more complicated than the 
analysis of (29). However, again, we are able to prove the convergence by a symbolic dynamics approach. 

Let again n^t) = \2x.i(t)\ for all i E V, and let n(t) = [rii(t), . . . ,n N (t)]* . Before finding a recursive equation for 
n(t), we need to introduce the following random variable. Let 

r a n = inf{i | at time t every node in V has been selected at least once}. 

T a n is an integer random variable which is almost surely finite, because nodes are selected with positive probability. 
Note that, from (34), we have that Xi(t) g {a, a + 1/2} for some integer number a, for all t > T a \\. This allows us 
to disregard the evolution before T a n and to analyze, for t > T a n, the symbolic dynamics as follows. For t > T a n, by 
recalling how the probabilistic quantizer works, we have that 

Vi^L if m(t) is even 

Q P (xi(t)) = { with probability 1/2 

if nAt) is odd. 

L^J with probability 1/2 
Let £i and £2 be two independent Bernoulli random variables with parameter 1/2 and define 173 : Z x Z — > Z by 



~h~ 




~k~ 


2 


+ 


2 



where r/, denotes the remainder of the division of h by 2. If, at time t, the edge is selected, then 

(n t (t + l),n 3 (t + l)) = feKW^^WJ.SsKW, %•(*))). (35) 



The following result characterizes the convergence properties of (35). Recall that in (31) we defined the set A = 
{y g Z N I 3a G Z such that y = 2al}. 

Theorem 4.3 Let n(t) evolve according to (35). For every fixed initial condition rt(0), almost surely there exists 
Icon g Z>o such that n(t) £ A for all t > T con . 

Proof: The proof is similar to the proof of Theorem 4.1 and Theorem 3.2, and it is based on proving the 
following three facts: 

(i) each element in the set A is invariant for the evolution described by (35); 

(ii) n(t) is a Markov process on a finite number of states; 

(iii) starting from any state in Z N , there is a positive probability for n(t) to reach a state in A in a finite number 
of steps. 

Let us now check them in order. 

(i) is trivial. 
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(ii) Markovianity immediately follows from the fact that subsequent random choices of the edges are independent 
and from (35). To prove that the states are finite, define m(t) and M(t) as in (15). Let h £ Z. Then, from the 
structure of 173 we have that 

• <73 (h, h) = h if h is even; 

• h — 1 < gs(h, h) < h + 1 if h is odd. 

The above two properties imply that m(0) — r m (o) < n<i(t) < M(0) + r^-(o) f° r all i £ V and for all £ > 0. This 
yields (ii). 

(iii) Observe that 

g 3 (h, k) = g 2 (h, k) - ^r h - £ 2 r k , 
where g 2 is the map defining the evolution of (30). Hence 

¥[g 3 (h,k)=g 2 (h,k)} > - A . 

This fact, combined with the fact (iii) proved along the proof of Theorem 4.1, ensures that, also for (35), there 
is a positive probability of reaching a state in A in a finite time. 

□ 

The above theorem and the previous remarks about T a n lead to the following claim about the original system. 

Corollary 4.4 Let x(t) evolve following (34)- Then almost surely there exists T con £ Z>o and a e Z such that 
Xi(t) = a, for all i £ V and for all t > T con . 

As for (29), this strategy does not preserve the initial average, in general. However, using the probabilistic quantizer 
guarantees that the average is preserved at least in expectation, and moreover, the convexity argument developed 
in the above proof, step (ii), provides a worst case bound on the error committed by the algorithm, as 

\a — 2 a vc(0)| < maxxi(O) — minx;(0) + 2. (36) 

i i 

Remark 4.1 (Deviation from the initial average) Let us discuss the issue of the deviation from the initial 
average in the totally quantized algorithm. The worst case bounds (33) and (36) leave the possibility that the 
amplitude of such deviation depend on the initial condition, but one could expect that the average behavior of the 
algorithm be better than that. In Figure 1 we compare the totally quantized strategy, using q p and qd, in terms of the 
deviation z = \a — N^ 1 l*x(Q)\. We plot z as a function of N, using a sequence of complete graphs of increasing size. 
For our plot we considered complete graphs, because of their faster convergence: however, the results are qualitatively 
the same for other families of graphs (rings, lattices, random geometric graphs). Simulations show that the consensus 
point obtained using q p is close to the average of the initial condition, even if the algorithm preserves the average of 
the states only in expectation. Instead, qd leads to a consensus point whose distance from the average of the initial 
condition is larger, increases with N, and depends on the initial condition. This behavior can be understood as a 
consequence of the accumulation of rounding errors. Indeed, the application of the symbolic dynamics g 2 increases 
the sum of the symbolic states by one, every time an odd argument is involved. Then, the larger the graph, the more 
time steps arc needed for convergence, the more errors accumulate. The randomness due to q pi instead, rules out 
this cffccl|3], and indeed (34) preserves the average in expectation at each time step. 



3 A similar compensation of the accumulation of errors can also be obtained by a deterministic modification of the quantizer 

qd- For instance, let q~d be defined as qd except that, for any integer y, cjdiy + 0.5) ~ \ if y is odd 

I y + 1 if y is even. 

Simulations suggest that such a correction actually reduces the perturbation of the average, but is less effective than random- 
ization. 
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2.5 r 




Fig. 1. Comparison in terms of z between the totally quantized algorithm using deterministic and probabilistic quantizers, for 
complete graphs of size N. The plotted values are the average of 1000 runs. The initial condition were drawn from a uniform 
distribution on the given intervals. The plotted lines are not smooth, due to significant variances. For N = 100, in the four 
cases of the figure the standard deviation is equal to 0.41, 0.081, 0.50, 0.085, respectively. 

5 Partially quantized update 



In this section we study the partially quantized update rule defined in (8) as 

Xi (t+1) = ± Xi (t) + -Xj(t) 

Xj(t+1) = \x j {t) + -x i {t). 



5.1 Deterministic quantizer 



Observe that the update (8) does not preserve the average of states. Nevertheless, if the deterministic quantizer is 
used, such an update induces the same symbolic dynamics g\ as the compensating strategy. Indeed, the continuous 
dynamics is 

Xi(t + 1) = -Xi(t) + -qd(xj(t)), 

and we deduce that 



[2 Xi (t + l)\ =[xi(t)\ +q d {x j (t)) 



n t (t) 


+ 


~nj(ty 


2 




2 



provided rii{t) = [2xi(t)\ as above in this paper. We argue a result which is similar to Corollary 3.3, but allows a 
larger deviation from the initial average. 

Theorem 5.1 Consider the algorithm (8) with quantizer q^. Then, there exists T con € Z>o such that, for allt > T con , 

\xi(t)-Xj(t)\<l Vi,je{l,...,iV} (37) 

and \\x(t) - x aV c(0)l||oo < 2. 

Proof: The inequality (37) can be derived as (17), since the symbolic dynamics is the same in both cases. 
Moreover, since for any integer k, it holds that [k/2\ + \k/2\ = k, we have that, given the definitions above, 
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rii(t + 1) + rij(t + 1) = rii(t) + nj(t) for any time t and any chosen pair Hence, the symbolic dynamics preserves 

the average of the symbolic states, even if the continuous dynamics does not. We then define n avc (t) = J^Lj rii(t), 
and remark that for any t > 0, 



r^ave(^) *^ave (^) 



since n avc (t) = n avc (0). Hence, for t > T c , 



1 1 

2N 



1 

~N 



i=l i=l 

i—\ 
N 

<-Y 

i=l 
1 - 

TV ^ 

i=l 
1 W 1 

= tf£9lL23i(t)J-2si(f)| 



-L2xi(t)J -x 4 (t) 



i=l 



1 



|»ave(*) - Save (0)| < 



^avc(^) 2^ avc (^) 



2«avc(0) - X avo (0) 



< 1. 



This in turn implies, together with (37), that \\x(t) — x ave (0)l||oo < 2. 



□ 



5.2 Probabilistic quantizer 

The symbolic analysis is again useful to provide a convergence result when the probabilistic quantizer is used, and 
then (8) becomes 

x i (t + l) = ^x i (t) + ^q p (x j (t)). (38) 

Theorem 5.2 Consider the algorithm (38). Then, it exists an integer random variable a, such that E[a] = a; avc (0) 7 
and almost surely 

lim Xi(t) = a, 

t — >oo 

for all i e {1, . ..,N}. 

Proof: We start our analysis by observing the following fact. Let and n = [2x\. If x — [x\ < 1/2, then 



q p {x) 



otherwise, if x — [x\ > 1/2, then 



r^l with probability \x~\ — x 

[^] + 1 with probability x — \x\ 



\t£~\ with probability x — [x\ 

— 1 with probability \x\ — x 
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Let now nj(i), for i £ {1, ... , N}, be equal to rii(t) = [2xi(t)\ . Recalling that [a^j (t)J = 
above and (38), it follows that the vector n(t) satisfies the following recursive equation 



»»(«) 

2 



from the observations 



«i(t+l) 



m(t) 


+ 




2 




2 


nj (t) 


+ 


m(t) 


2 




2 



ej(*), 
ei(t), 



where 



ek(t) 



1 with probability xit(i) — [a^fc (^)J 
with probability 1 — (xk(t) — L x fe(^)J) 
— 1 with probability 1 — (xk(t) — [xfcWJ) 
with probability Xfc(t) — L^fcWJ 



if nfc(t) is even 
if nk(t) is odd. 

By reintroducing the map g\ defined in (13), we can write in a compact way that 

(m (t + 1 ) , rij (t + 1 ) ) = g i (m (t ) , rij (t ) ) + ( ej (t ) , e, (t ) ) . 

Note that nfc(i) is even if and only if Xk(t) — [xk{t)\ < 1/2. Then, we have that F(ej(t) = 0) > h, and hence 
P [(rii(t + 1), rtj(t + 1)) = gi(rii(t), rij(i))] > \. Moreover, observe that, recalling the notations in the proof of Theo- 
rem 4.1, for all i G {1, ... j N} and t > 0, 

m(0) - r TO ( ) < nj(t) < M(0) + 1 - r M(0) . 

This invariance property and the above connection with c/i imply that almost surely it exists T con G Z>o such that 
n(t) belongs to the set A = {y £ 3q e Z such that y = 2al} defined in (31). 

To conclude the proof, we remark that, for any t > 0, and any i G {1, . . . , ./V}, 

2xi(t + 1) - L2x t (t + 1)J = ^(i) + qp(xj(t)) - [Xi(t) + q P (xj(t))\ = x t {t) - [x t (t)\. 

Equivalently, 2x i (t+l)-n i (t+l) = L^J. If now we assume t > T con , then we have shown that almost surely 

rii(t + 1) = n.i(t) = 2a, for all £ G {1, ... , N}. This fact implies that the equality above reduces to 2xt{t + 1) — 2a = 
Xi{t) — a, which in turns yields that 

Xi(t+1) = -(xi{t)+a). 

The latter deterministic dynamics implies that lnn t —>oa %i(t) = a, for all i G {1, . . . , N}. The argument is concluded 
recalling that the average is preserved in expectation. 



□ 



6 Discussion 



In this paper we studied the gossip algorithm for the consensus problem with quantized communication. We proposed 
three update rules, a totally quantized, a partially quantized and a compensating rule. In the first one, the agents 
use only quantized information in order to update their states. In the second one, they have access to exact (non- 
quantized) information regarding their own state, and use it for the update. In the third one, each agent uses both 
exact and quantized information about its own state, in order to ensure that the average of the states is globally 
preserved. This positive feature has the negative counterpart of preventing that consensus (all states be equal) be 
exactly reached. In all the algorithms we proposed, consensus can be approached up to the quantizer precision. 
Moreover, deviations from the average of the initial state are small, except in the case of the totally quantized rule 
with deterministic quantizer. We summarize these results in Table 1. In the remaining of this section, we discuss 
some extensions and potential developments of our results. 
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Totally Quantized 


Compensating 


Partially Quantized 




Finite time conv. to consensus 
Large error from average 


Finite time conv. to 
||a;-a;ave(0)l]|oo < 1/2 
Average preserved 


Finite time conv. to 

||x-Xave(0)l||oc < 2 

Average not preserved 


q P 


Finite time conv. to consensus 
Average preserved in expectation 


Asympt. conv. to 
N-V a y/E[\\x-x m {0)l\\%] < 1/2 
Average preserved 


Asympt. conv. to consensus 
Average preserved in expectation 



Table 1 

Summary of results. 

6.1 Uniform quantizers 



In Section 2 we introduced two significant quantizers, and q p . The present section provides a brief discussion about 
the generality of our results and about the motivations for our choices. Here we shall consider a class of uniform 
quantizers, that is maps q : R — > Z with the property that q(x) G 

Let us first consider deterministic uniform quantizers. The results given for q c i, namely Corollaries 3.3 and 4.2, and 
Theorem 5.1, can be extended to the rounding down x i— ► [^J an d the rounding up quantizer x \— > fx], defining 
suitable symbolic dynamics on n(t). Let us consider the rounding down, first. For the compensating strategy, the 
induced symbolic dynamics is 



9i{h,k) 



h 




k 




+ 




2 


2 



whereas for both the totally and the partially quantized strategy, it is 

g 5 {h,k) 

Their analysis is analogous to the one in Theorems 4.1 and 3.2, respectively. The analysis of the rounding up quantizer 
easily follows if we notice that \x] = — [— x\ . 

Second, we consider randomized uniform quantizers, which generalize the probabilistic quantizer q p . We observe that 
the proof of Theorem 4.3 is based on the fact that for all a G Z, 



q p (a+ 1/2) 



1 with probability p 
with probability 1 — p, 



(39) 



with p = 1/2. Actually, the argument is valid for any p G (0, 1), and hence Corollary 4.4 can be extended to any 
uniform quantizer satisfying Equation (39) with p G (0, 1). On the other hand, the mean squared error results in 
Section 3.2 are a direct consequence of the statistical properties of q p stated in Lemma 2.1: they thus hold for any 
quantizer sharing these properties. In particular, both for Section 3.2 and for Theorem 5.2, the randomized quantizer 
needs to be unbiased. The probabilistic quantizer q p is the only unbiased uniform quantizer, and hence it is the only 
one to which all our arguments apply. This justifies our choice of the specific probabilistic quantizer q p . 



6.2 Weighted averaging 



For simplicity reasons we have chosen to consider consensus gains equal to 1/2. Hence, although we believe that the 
case we studied already shows the significant features of the problem, it is natural to ask whether our results can be 
extended to general choices of weights, considering an update matrix 



P(t)=I-eE ij , (40) 

instead of the matrix (3). Such extension is likely to be possible, modulo solving some technical difficulties in the 
definition and analysis of the suitable symbolic dynamics. For instance, we conjectured in [6] that Corollary 3.3 apply 
to the compensating rule with deterministic quantizers and general update (40), provided e is a rational number in 
(0, 1/2]. This fact has actually been proven for any s G (0, 1/2] in the pair of recent conference papers [10,11]. 
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6.3 Speed of convergence 



Our paper has been devoted to prove convergence results for a set of algorithms. After proving convergence, the 
second analysis issue is studying the speed of convergence of the algorithms or, equivalently, the time needed to 
reach (or to approach in a suitable sense) their limit states. The non-quantized gossip algorithm [2] is known to 
asymptotically converge (in a mean squared sense) at exponential speed, with a rate which depends on the matrix 
W. It is thus natural to conjecture that the convergence of the quantized version be roughly exponential, as long as 
the differences among states are much larger than the quantization step: results in this sense have been given in [6] . 
This belief is confirmed by simulations, as those we report in Figure 2. 




Fig. 2. Evolution of the squared distance from the initial average for the three algorithms presented in the paper, compared 
with the standard gossip algorithm. Average of 10 runs on a geometric graph of iV = 20 nodes. Random uniform initial 
condition with Xi(0) £ [-100, 100] for all i € {1, ... , TV}. 

However, as we have shown in this paper, the granularity effects eventually come out in the convergence, making the 
systems converge in finite time to some limit set. Upper bounds on this convergence time can be found by the Markov 
chains techniques used in [9], and recently in [11] and [18]. Such bounds are usually very conservative in terms of 
their dependence on the number of agents: for this reason, we do not pursue such analysis here. In our opinion, it 
would be interesting to better understand the transition in the behavior of the algorithm: what is missing is a sharp 
result quantifying when the granularity effects become non-negligible, and which is the speed of convergence of the 
system during such transition. 
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